CDS

Accession Number TCMCG060C20654
gbkey CDS
Protein Id XP_015572789.1
Location join(1800..2168,3181..3407,4209..4276,4516..4605,4964..5103,5277..5345,5426..5514,6225..8387,8522..8599,8703..8814,10204..10254,10361..10411)
Gene LOC8274541
GeneID 8274541
Organism Ricinus communis

Protein

Length 1168aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA34677
db_source XM_015717303.2
Definition dentin sialophosphoprotein [Ricinus communis]

EGGNOG-MAPPER Annotation

COG_category S
Description Occludin homology domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
KEGG_ko ko:K11807        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTACGGTGGTTCCTCCAAGTTGGGCGGCGGCCGTGGCGGCGGAGGTGGCCGGGGAGGGGGGCGTCTATCTTCGTTCCCTCCGCCGCCGCCTCACCGGTCATCCACATCCAATAAAAACTCCCGCCTCTCCCTCGGAGGTGGCGGCGGTTCCAATCCTCGGAGCCGCTCAGGTGGTACCTCGAGTGCCACAGCAGCGGCAGCAGCAGTTGAAGAGACGTTTTCCCTTATTCCTGGGAAAAACCCACTTGCTTTTGCAATGATTATTAGATTAGCACCTGATTTGGTTGATGAGATCAGAAAAATTGAAGCTCAAGGCGGTTCTGCTAAAATCAAATTTGATTCTATTGGCAGTAATAATTTTGGAAATGTCATTGATGCGGGAGGTAAGGAATTCAGATTTACATGGTCGAGGGAATTTGGTGACCTCTGTGACATTTATGAAGAACGTCAAAGTGGTGAAGATGGAAACGGTGTGCTTGTTGAATCTGGGTCTGCCTGGCGTAAGGTGAATGTCCAACGTGTCTTAGATGAATCTACTAAGAACCATGTAAAGAAGCTATCAGAGGAAGCTGAACGCAAAAATAAATCACGCAAAGCTATTGTGCTAGATCAGGGGAACCCATCAATGAAGAATCAACTAAAGCAGTTGGCACTTGCCGAGTCTACACCATGGAGGATGTCCTTTAAGCGAAAAGAGCCTCCATATAAAAAGCAGAAAGTTGAACCACCACCAGGTACATCACGGAATTACAAAAAAGAAATCCAGACCAAGGCTTCTAGTGCTGTACAGGAGACAACAGGGCACAAAGGGAACTTTGGGACTAAACCAATGGATTTGCGGAGCATGTTGATCACTCTACTGATTGAAAATCCTAAAGGAATGAGCTTGAAGGCCTTGGAGAAAGCTATTGGGGATAAATTACCGAACTCTGTTAAAAAGATAGAGCCCATTATTAAAAAAATTGCGACTTTCCAAGCTCCAGGAAGATATTTCTTGAAACCAGGAGTGGAGTTGGAAAGCTTCAAGAAACCTTCATCTGAGAGTGGAAGTTCTCCTGAGGACAACCATCAGCAGACACTTGTTCCTGAAGACAACCATGACAACACACCTGCTCTTGAATCAAGACCTGCTGAGAAAAGTCCTGCTGTCAGATTTGAGGAACATGCCCAAATAAAATCTAAATTTGAAGAAGAGTCGAATGCCTTAGAAAAAATTGATGTCCACGAGAAAAAAATCTCCGACAATAGTGAGGGGCAGGCAAGCTCTAGTGAAAGTGGAAGTGACAGTGATAGTGAAAGTGACAGTAGTGACAGTGGGAGTGATAGTGGAAGCCGCAGCAGGAGCAGGAGCAGAAGCAGAAGCAGAAGCAGAAGCCCGGTAGGGAGTGGAAGTGGGAGTAGCAGTGACAGTGAAAGTGATGCTTCATCCAACAGTAAAGAAGGATCTGATGAGGATGTGGATATTTTAAGTGATGATGACAAAGAACCCCAGCACAAGTTACAGGCCTCTGAACCACGTTTCACAGCATCTCCTGATCCATGGAGATCTGTGCAGAATGGGACAGATGAGAAGCAAGATGGCGATGGATCTGATGCAGTTGACATTGATGGTCCTGGATCTGCTGGACCTTATGGCGAAGGTCATGAATCTGAAGCTGTTGACATTGAGAAAGATTTGGCTAATCATGAGAAGGTAGTTGAATTAGCTGCAAATGACAGTTTGCTTCCCACCCAAGAAGTTGACATACATGTGGAAGGAGCTCAATCCATTATCACTGATCATGATGCCATCCAAGAGCGTCAAAATTTCATAGGAACTCTGTTTGATGATAATGAAAATATGGTTAGGGACAGCTTCAGGCATGAGAATTCTGACAGTTCTGAGAGGATATCCAAAAGTAAGTCCAAAAGGGTTCCTGATATGAAGCACTTTGATGAGATATCTGACACTGCTAAAAGATTGAAAGTTGACAGTGTGGCTCAACCACCCATTTCTGAGGTTAGAGATGTCCAATTACCTGAGAGCCCTTACAACAAAAATATTGAAGACACTTTTAGGGGCCCTGCTATTCAAGCGATGAACAGGGCTGATAGGGAAGGAAATGCAGATTTTGGCTCACATAAGGCATTTAATGTACAACCTAGTTCAGATTTTCAGCAACCTGACCGAAGGTCTTCTGATAAAATTGCACGCTCAAAAGCTTCTGATTCAACCGGGAGATCTAAACATACTGAGAGATCAGGGCATGGTCGTAAATTTTCTTCAAAGGATTCTCATTTGCATGAAGATTTTCCTATCCAAAGAGAAAAGGCTTCTAGAGACACTCTAAATGAAGGTAATTCTTCGAAGGACAAAAAGGTGCCAAGAAACTCCAAGGGAGGTGGAGCTGGAGGCAGACATTCAACTTCCTTTGATTCTGACTATAGGAAACTGGGTGAGACGTATGGGAACTTCAAGGATGCTGCACAACCTTCACCTAAGGATGGCAACAGAGTTGATGTAGAAAAATACCCAGCTGTCAGTGGAAGAAGCCTCCAAAGAGAGCTTTCAGAGCTGGAGTTAGGAGAATTTCGCGAGCCATTGCTTGAGGATAAACCAGTTAAGAAACAATTTGACAGGAAGGGCCCTTTTAAACAGTCAGAGAGCAAACCAAGCACTTCAGATAACTGTAACTCAGATTTTAATAAGGCAAAACCTGCTGGAAAGGCAACATTAGACTCAGGAAAGCCTTCTCCTTCCAATCTAGGTACTGGGTTTAAGAGAACCCCTGATCATCATACTGAAGATGTAACAAGATCTCACCTTAAGGTTGCACAATCTCATCTACAACATCTTTCAAGGCTAGATAATGCTGAAGTTGGATCTCACTTCAGCAAGTTGGCAGATACGAATAGTAGATTAAGACAGAATGAAGCTGGGGCAAAACTAGGAAACAGTATAGAAGGCTATGGAGAAAACCATAAAAGAGGCCCTAGCAATGCACAACCACTGCATGAGTCTAAACGTGGATTGCTTTCCAACTCGATTAAGGAAAGTAAAACGCAAACATCTAATAGAATTCCTGACTTGGTAGATGGACAAAAGGAAACAGTTGTGACTGAAGCCAATAATGCTCGAAAGAGGAGAGAATCTTTGTCCGAGGAAGATGGCTCTTTTTCGAAGTATGTAAAGGACACACCCGAGCTCAAAGGACCAATTAAGGATTTTTCTCAGTACAAAGAATATGTGCAGGAATACCGCGATAAGTATGATAGTTACTGTGCCTTGAACAAGATCCTAGAAACCTACAGGAATGATTTCCATAGACTGGGAAAGGACCTTGAATTTGCAAAAGGCAGGGATATGGACAAATATCATAAGATCTTGGTGCAGTTGCAGGAATCTTATCTTCAGTGTGGACCGAGGCACAAACGGTTCAAAAAGATATTTGTTGTGCTGCATGAAGAACTAAAGAACCTAAAGCAAAGAATTAAAGAATATGCAGTCACTTGTACAAAAGACTGA
Protein:  
MYGGSSKLGGGRGGGGGRGGGRLSSFPPPPPHRSSTSNKNSRLSLGGGGGSNPRSRSGGTSSATAAAAAVEETFSLIPGKNPLAFAMIIRLAPDLVDEIRKIEAQGGSAKIKFDSIGSNNFGNVIDAGGKEFRFTWSREFGDLCDIYEERQSGEDGNGVLVESGSAWRKVNVQRVLDESTKNHVKKLSEEAERKNKSRKAIVLDQGNPSMKNQLKQLALAESTPWRMSFKRKEPPYKKQKVEPPPGTSRNYKKEIQTKASSAVQETTGHKGNFGTKPMDLRSMLITLLIENPKGMSLKALEKAIGDKLPNSVKKIEPIIKKIATFQAPGRYFLKPGVELESFKKPSSESGSSPEDNHQQTLVPEDNHDNTPALESRPAEKSPAVRFEEHAQIKSKFEEESNALEKIDVHEKKISDNSEGQASSSESGSDSDSESDSSDSGSDSGSRSRSRSRSRSRSRSPVGSGSGSSSDSESDASSNSKEGSDEDVDILSDDDKEPQHKLQASEPRFTASPDPWRSVQNGTDEKQDGDGSDAVDIDGPGSAGPYGEGHESEAVDIEKDLANHEKVVELAANDSLLPTQEVDIHVEGAQSIITDHDAIQERQNFIGTLFDDNENMVRDSFRHENSDSSERISKSKSKRVPDMKHFDEISDTAKRLKVDSVAQPPISEVRDVQLPESPYNKNIEDTFRGPAIQAMNRADREGNADFGSHKAFNVQPSSDFQQPDRRSSDKIARSKASDSTGRSKHTERSGHGRKFSSKDSHLHEDFPIQREKASRDTLNEGNSSKDKKVPRNSKGGGAGGRHSTSFDSDYRKLGETYGNFKDAAQPSPKDGNRVDVEKYPAVSGRSLQRELSELELGEFREPLLEDKPVKKQFDRKGPFKQSESKPSTSDNCNSDFNKAKPAGKATLDSGKPSPSNLGTGFKRTPDHHTEDVTRSHLKVAQSHLQHLSRLDNAEVGSHFSKLADTNSRLRQNEAGAKLGNSIEGYGENHKRGPSNAQPLHESKRGLLSNSIKESKTQTSNRIPDLVDGQKETVVTEANNARKRRESLSEEDGSFSKYVKDTPELKGPIKDFSQYKEYVQEYRDKYDSYCALNKILETYRNDFHRLGKDLEFAKGRDMDKYHKILVQLQESYLQCGPRHKRFKKIFVVLHEELKNLKQRIKEYAVTCTKD